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Response to Amendment 

This office action is in response to the amendment filed on March 10, 2008. 
Claims 1, 4-10, 12-15, 18-21 and 24-26, are pending for examination. 

Claim Rejections - 35 USC §112 

The following is a quotation of the first paragraph of 35 U.S.C. 112: 

The specification shall contain a written description of the invention, and of the manner and process of 
making and using it, in such full, clear, concise, and exact terms as to enable any person skilled in the 
art to which it pertains, or with which it is most nearly connected, to make and use the same and shall 
set forth the best mode contemplated by the inventor of carrying out his invention. 

Claims 1, 4-10, 12-15, 18-21 and 24-26, are rejected under 35 U.S.C. 112, first 
paragraph, because the specification, while being enabling for iterative clustering [e.g., 
Please refer to Sections: 0042-0046], does not reasonably provide enablement for non- 
iterative clustering. The specification does not enable any person skilled in the art to 
which it pertains, or with which it is most nearly connected, to make and use the 
invention commensurate in scope with these claims. 

As to claims 1,15 and 21, the instant invention using a K-mean clustering 
technique to iterative clustering the claimed a set of records, which is deemed to be 
performed iteratively as specified in sections: 0003 &0015 - 0046. In addition, the 
examiner disagrees with applicant's arguments that section: 0015 support the non- 
iterative processing. These sections were excerpted as following: 
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[0015] The various features of the present invention and the manner of attaining 
them will be described in greater detail with reference to the following description, 
claims, and drawings, wherein reference numerals are reused , where 
appropriate, to indicate a correspondence between the referenced items, and 
wherein: 

[0016] FIG. 1 is a process flow chart illustrating a method of the record clustering 
system of the present invention; and 

[0017] FIG. 2 is a schematic illustration of an exemplary operating environment in 
which a record clustering system of the present invention can be used." 
[0042] The resulting clustering may be further refined by reducing the number of 
the clusters. For example, it may be desirable to dissolve a cluster having a small 
size, i.e., having a small number of records. This may be accomplished by 
means of the following iterative process : 
[0043] Rank the clusters by size. 
[0044] Select the smallest cluster. 

[0045] For each record of the cluster, find the one of the larger clusters that 
matches most of the significant attributes. If more than one cluster should be 
considered, either choose the largest of these clusters or use some kind of 
distance measure to find the nearest cluster. 

[0046] Repeat until the desired number of clusters has been reached or if the 
similarity of records and clusters is too small . 
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Thus, in contrary to applicant's arguments, the claimed "non-iterative" processing 
contradicted to the instant specification. As such, the instant specification does not 
enable any person skilled in the art to which it pertains, or with which it is most nearly 
connected, to make and use the invention commensurate in scope with these claims. 
Furthermore, because the claimed "non-iterative" limitation mutually exclusive with the 
instant invention scope, thereby, there will have no prior art rejection for this limitation 
until the issues being resolved. 

As to claims 4-10, 12-14, 18-20 and 24-26, these claims have the same defects 
as their base claims respectively, hence, are rejected for the same reason. 

Claim Rejections - 35 USC § 103 

The following is a quotation of 35 U.S.C. 1 03(a) which forms the basis for all 

obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed 
or described as set forth in section 102 of this title, if the differences between the 
subject matter sought to be patented and the prior art are such that the subject 
matter as a whole would have been obvious at the time the invention was made 
to a person having ordinary skill in the art to which said subject matter pertains. 
Patentability shall not be negatived by the manner in which the invention was 
made. 

Claims 1,4-10, 12-15, 18-21 and 24-26, are rejected under 35 U.S.C. 103(a) as 
being unpatentable over U.S. Patent No. 6,012,058 issued to Fayyad et al. (hereinafter 
referred as FAYYAD), in view of U.S. Patent No. 6,636,862 issued to Lundahl et al. 
(hereinafter referred as Laundahl). 
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Claim 1: 

FAYYAD discloses a method of clustering a set of records, each of the records 
having attribute values for a set of attributes [e.g., Abstract], the method comprising: 

for each attribute of the set of attributes, determining a characteristic value for 
said each attribute, the characteristic value being one of a mean value and a median 
value of the attribute values of said attribute across the records [e.g., col. 3, lines 4-14, 
the use of K-mean Euclidean distance technique at col. 6, lines 27-63]; 

for each attribute value, determining a deviation from the characteristic value of 
said each attribute [e.g., col. 3, lines 27-36, col. 4, lines 42 - col. 5, lines 56, Fig. 4 and 
associated texts]; 

for each record, sorting the set of attributes based on deviation of the attribute 
values from the characteristic value of said each attribute, to provide a key; and [e.g., 
col. 8, lines 31-45, the RSDistSort(RSDist) at col. 17, Appendix B]; 

combining the set of records based on the key into a clustering results that 
includes a plurality of clusters, wherein the key comprises an ordered list of the 
set of attributes and the deviations from the characteristic value of said each 
attribute; [e.g., col. 9, lines 46-56, the use of FindMergePartner(CSIndex, CSNew) 
processing at col. 23, Appendix C]. 

FAYYAD did not specifically disclose the details of refining the clustering result 
by: identifying a cluster having a smallest number of records; for each record of the 
identified cluster, searching another cluster having records with best matching keys; and 
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distributing the cluster with the smallest number of records to the other cluster having 
records with best matching keys, to reduce the total number of clusters. 

However, Lundahl gives the details the claimed cluster refining features, 
comprising: identifying a cluster having a smallest number of records [e.g., col. 16, lines 
36 - col. 17, lines 4]; for each record of the identified cluster, searching another cluster 
having records with best matching keys [e.g., col. 15, lines 1-26]; and distributing the 
cluster with the smallest number of records to the other cluster having records with best 
matching keys, to reduce the total number of clusters [e.g., col. 15, lines 44-59, col. 22, 
lines 1-32]. 

Claim 5: 

except the limitations recited in claim 1 , the combined invention of FAYYAD and 
Lundah further discloses determining the deviation comprises calculating a difference 
between said each said attribute value and the characteristic value of the corresponding 
attribute, and dividing the difference by the characteristic value of said each said 
attribute [e.g., FAYYAD: col. 11, lines 48-56]. 

Claim 6: 

except the limitations recited in claim 1, the combined invention of FAYYAD and 
Lundah further discloses sorting the set of attributes comprises using absolute values of 
the deviations of the attribute values as a sorting criterion [e.g., FAYYAD: the 
RSDistSort(RSDist) at col. 17, Appendix B]. 
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Claim 7: 

except the limitations recited in claim 1, the combined invention of FAYYAD and 
Lundah further discloses that a first record of the set of records contains a first key and 
a second record of the set of records contains a second key; and further comprising 
placing the first key and the second key into a single cluster if the first key and the 
second key have identical sub-sequences of a first length [e.g., Abstract, lines 1-7, 
Lundah: Fig. 5 and associated texts]. 

Claim 8: 

except the limitations recited in claim 1, the combined invention of FAYYAD and 
Laundahl further discloses that first record of the set of records contains a first key and 
a second record of the set of records contains a second key; and further comprising 
placing the first key and the second key into a single cluster if the first key and the 
second key have identical sub-sequences of absolute values of the deviations [e.g., 
Lundah: col. 20, lines 6 -58]. 

Claim 9: 

except the limitations recited in claim 1, the combined invention of FAYYAD and 
Laundahl further discloses that a first record of the set of records contains a first key 
that has a first sub-sequence, and a second record has a second sub-sequence 
contains a second key; and further comprising placing the first key and the second key 
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into a single cluster if the first and second sub-sequences comprise the same set of 
attributes [e.g., Abstract, lines 1-7]. 

Claim 10: 

except the limitations recited in claim 9, the combined invention of FAYYAD and 
Laundahl further discloses that the first and second sub-sequences comprise the same 
set of attributes irrespective of a sign of the deviations of the attribute values [e.g., 
Lundah: col. 20, lines 6 -58, Note the absolute value of an attribute is irrespective of a 
sing]. 

Claim 12: 

except the limitations recited in claim 9, the combined invention of FAYYAD and 
Laundahl further discloses reducing a length of the first sub-sequence and a length of 
the second sub-sequence in order to find a best match [e.g., Lundah: col. 15, lines 44- 
59]. 

Claim 13: 

except the limitations recited in claim 1 1 , the combined invention of FAYYAD and 
Laundahl further discloses using a distance measure to find another cluster for a record 
of the identified cluster [e.g., the K-mean technique at FAYYAD: Abstract, lines 9-16]. 



Claim 14: 
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except the limitations recited in claim 1 1 , the combined invention of FAYYAD and 
Laundahl further discloses the distance measure comprises a Euclidean distance [e.g., 
FAYYAD: col. 6, lines 35-47; ]. 

Claim 15: 

This claim incorporates substantially similar subject matter as claim 1 in form of 
computer program product, hence is rejected along the same rational. 

Claim 18: 

This claim incorporates substantially similar subject matter as claim 4 in form of 
computer program product, hence is rejected along the same rational. 

Claim 19: 

This claim incorporates substantially similar subject matter as claim 5 in form of 
computer program product, hence is rejected along the same rational. 

Claim 20: 

This claim incorporates substantially similar subject matter as claim 6 in form of 
computer program product, hence is rejected along the same rational. 
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As to claims 21 and 24-26, these claims recite similar subject matter as claims 1 , 
4-10, 12-15 and 18-20 in form of an abstract computer system, hence are rejected 
along the same rational. 

FAYYAD and Lundahl are both of the same endeavor to optimizing the clustering 
of a set of records based on the characteristic value of each attributes associated with 
the record via K-mean Euclidean distance technique, thus, with the teachings of 
FAYYAD and Lundahl in front of him/her, it would have been obvious for an ordinary 
skilled person in the art at the time the invention was made to apply the detailed cluster 
refining features disclosed by Lundahl in FAYYAD's system, because by doing so, as 
suggested by Lundahl, the combined invention will use the well-known Kohonen SOM 
neural network technique to refining and distribute the smallest number of records to the 
other cluster in a best key matching mode, and thereby optimizing the system clustering 
processing. 

Claims 4: 

except the limitations recited in claim 1 , the combined invention of FAYYAD and 
Lundah further discloses determining the deviation comprises calculating a difference 
between each said attribute value and the characteristic value of each said attribute 
[e.g., FAYYAD: col. 11, lines 34-56]. 



Claim 5: 
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except the limitations recited in claim 1, the combined invention of FAYYAD and 
Lundah further discloses determining the deviation comprises calculating a difference 
between said each said attribute value and the characteristic value of the corresponding 
attribute, and dividing the difference by the characteristic value of said each said 
attribute [e.g., FAYYAD: col. 11, lines 48-56]. 

Claim 6: 

except the limitations recited in claim 1 , the combined invention of FAYYAD and 
Lundah further discloses sorting the set of attributes comprises using absolute values of 
the deviations of the attribute values as a sorting criterion [e.g., FAYYAD: the 
RSDistSort(RSDist) at col. 17, Appendix B]. 

Claim 7: 

except the limitations recited in claim 1 , the combined invention of FAYYAD and 
Lundah further discloses that a first record of the set of records contains a first key and 
a second record of the set of records contains a second key; and further comprising 
placing the first key and the second key into a single cluster if the first key and the 
second key have identical sub-sequences of a first length [e.g., Abstract, lines 1-7, 
Lundah: Fig. 5 and associated texts]. 



Claim 8: 
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except the limitations recited in claim 1, the combined invention of FAYYAD and 
Laundahl further discloses that first record of the set of records contains a first key and 
a second record of the set of records contains a second key; and further comprising 
placing the first key and the second key into a single cluster if the first key and the 
second key have identical sub-sequences of absolute values of the deviations [e.g., 
Lundah: col. 20, lines 6 -58]. 

Claim 9: 

except the limitations recited in claim 1 , the combined invention of FAYYAD and 
Laundahl further discloses that a first record of the set of records contains a first key 
that has a first sub-sequence, and a second record has a second sub-sequence 
contains a second key; and further comprising placing the first key and the second key 
into a single cluster if the first and second sub-sequences comprise the same set of 
attributes [e.g., Abstract, lines 1-7]. 

Claim 10: 

except the limitations recited in claim 9, the combined invention of FAYYAD and 
Laundahl further discloses that the first and second sub-sequences comprise the same 
set of attributes irrespective of a sign of the deviations of the attribute values [e.g., 
Lundah: col. 20, lines 6 -58, Note the absolute value of an attribute is irrespective of a 
sing]. 
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Claim 12: 

except the limitations recited in claim 9, the combined invention of FAYYAD and 
Laundahl further discloses reducing a length of the first sub-sequence and a length of 
the second sub-sequence in order to find a best match [e.g., Lundah: col. 1 5, lines 44- 
59]. 

Claim 13: 

except the limitations recited in claim 1 1 , the combined invention of FAYYAD and 
Laundahl further discloses using a distance measure to find another cluster for a record 
of the identified cluster [e.g., the K-mean technique at FAYYAD: Abstract, lines 9-16]. 

Claim 14: 

except the limitations recited in claim 1 1 , the combined invention of FAYYAD and 
Laundahl further discloses the distance measure comprises a Euclidean distance [e.g., 
FAYYAD: col. 6, lines 35-47; ]. 

Claim 15: 

This claim incorporates substantially similar subject matter as claim 1 in form of 
computer program product, hence is rejected along the same rational. 



Claim 18: 
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This claim incorporates substantially similar subject matter as claim 4 in form of 
computer program product, hence is rejected along the same rational. 

Claim 19: 

This claim incorporates substantially similar subject matter as claim 5 in form of 
computer program product, hence is rejected along the same rational. 

Claim 20: 

This claim incorporates substantially similar subject matter as claim 6 in form of 
computer program product, hence is rejected along the same rational. 

As to claims 21 and 24-26, these claims recite similar subject matter as claims 1 , 
4-10, 12-15 and 18-20 in form of an abstract computer system, hence are rejected 
along the same rational. 

Response to Arguments 

Applicant's arguments filed on March 10, 2008 have been fully considered but 
they are not persuasive. 

In response to applicant's argument under 35 U.S.C. 112, first paragraph that the 
references fail to show certain features of applicant's invention, it is noted that the 
features upon which applicant relies (i.e., performance of the clustering method requires 
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only two passes over the characteristics of variable record data and a refinement step is 
performed in a single pass only) are not recited in the rejected claim(s). Although the 
claims are interpreted in light of the specification, limitations from the specification are 
not read into the claims. See In re Van Geuns, 988 F.2d 1 181 , 26 USPQ2d 1057 (Fed. 
Cir. 1993). 

Since any other arguments made by applicant are more limiting than the claimed 
language, thus, the examiner maintains the 35 U.S.C. rejections on record. 

Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to SUSAN Y. CHEN whose telephone number is (571)272- 
4016. The examiner can normally be reached on Monday - Friday from 7:00-4:30. 

If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, Mofiz Apu can be reached on 571-272-4080. The fax phone number for the 
organization where this application or proceeding is assigned is 571-273-8300. 
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Information regarding the status of an application may be obtained from the 
Patent Application Information Retrieval (PAIR) system. Status information for 
published applications may be obtained from either Private PAIR or Public PAIR. 
Status information for unpublished applications is available through Private PAIR only. 
For more information about the PAIR system, see http://pair-direct.uspto.gov. Should 
you have questions on access to the Private PAIR system, contact the Electronic 
Business Center (EBC) at 866-217-9197 (toll-free). If you would like assistance from a 
USPTO Customer Service Representative or access to the automated information 
system, call 800-786-9199 (IN USA OR CANADA) or 571-272-1000. 



/Susan Y Chen/ 
Partial Sig. Examiner 
Art Unit 2161 
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/Apu M Mofiz/ 

Supervisory Patent Examiner, Art Unit 2161 



Application Number 



Application/Control No. 

10/700,908 

Examiner 

SUSAN Y. CHEN 



RASPL, STEFAN 
Art Unit 

2161 



U.S. Patent and Trademark Office 



Part of Paper No. 20080520 



